Modeling Query Term Dependencies in Information Retrieval with Markov Random Fields

نویسندگان

  • Donald Metzler
  • W. Bruce Croft
چکیده

This paper develops a general, formal framework for modeling term dependencies via Markov random fields. The model allows for arbitrary text features to be incorporated as evidence. In particular, we make use of features based on occurrences of single terms, ordered phrases, and unordered phrases. We explore full independence, sequential dependence, and full dependence variants of the model. A novel approach is developed to train the model by directly maximizing mean average precision. Our results show that significant improvements are possible by modeling dependencies, especially on larger web collections.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Beyond Bags of Words: Modeling Implicit User Preferences in Information Retrieval

This paper reports on recent work in the field of information retrieval that attempts to go beyond the overly simplified approach of representing documents and queries as bags of words. Simple models make it difficult to accurately model a user’s information need. The model presented in the paper is based on Markov random fields and allows almost arbitrary features to be encoded. This provides ...

متن کامل

Source Code Retrieval from Large Software Libraries for Automatic Bug Localization

Sisman, Bunyamin Ph.D., Purdue University, December 2013. Source Code Retrieval from Large Software Libraries for Automatic Bug Localization. Major Professor: Avinash C. Kak. This dissertation advances the state-of-the-art in information retrieval (IR) based approaches to automatic bug localization in software. In an IR-based approach, one first creates a search engine using a probabilistic or ...

متن کامل

An Information Retrieval Expansion Model Based on Quasi-Clique

Query expansion is an important technology for improving retrieval performance in information retrieval. Many Studies have found contexts within query that strongly influence the interpretation of a query. In this paper, we propose the graph mining technique called Quasi-Clique as query context in Markov network retrieval model. Our approach exploits contextual information mined from the term M...

متن کامل

Improving Query Expansion for Information Retrieval Using Wikipedia

Query expansion (QE) is one of the key technologies to improve retrieval efficiency. Many studies on query expansion with relationships from single local corpus suffer from two problems resulting in low retrieval performance: term relationships are limited and unlisted query terms have no expansion terms. To address these problems, relationships between terms captured from Wikipedia are superim...

متن کامل

Incorporating Semantic Knowledge with MRF Term Dependency Model in Medical Document Retrieval

Term dependency models are generally better than bag-ofword models, because complete concepts are often represented by multiple terms. However, without semantic knowledge, such models may introduce many false dependencies among terms, especially when the document collection is small and homogeneous(e.g. newswire documents, medical documents). The main contribution of this work is to incorporate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005